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Abstract. We consider an extension of the popular matching problem 
in this paper. The input to the popular matching problem is a bipartite 
graph G = (AuB, E), where A is a set of people, B is a set of items, and 
each person a £ A ranks a subset of items in an order of preference, with 
ties allowed. The popular matching problem seeks to compute a matching 
M* between people and items such that there is no matching M where 
more people are happier with M than with M* . Such a matching M* 
is called a popular matching. However, there are simple instances where 
no popular matching exists. 

Here we consider the following natural extension to the above problem: 
associated with each item b G B is a non-negative price cost(i>), that 
is, for any item b, new copies of b can be added to the input graph by 
paying an amount of cost (6) per copy. When G does not admit a popular 
matching, the problem is to "augment" G at minimum cost such that 
the new graph admits a popular matching. We show that this problem 
is NP-hard; in fact, it is NP-hard to approximate it within a factor of 
^/ni/2, where m is the number of people. This problem has a simple 
polynomial time algorithm when each person has a preference list of 
length at most 2. However, if we consider the problem of constructing a 
graph at minimum cost that admits a popular matching that matches all 
people, then even with preference lists of length 2, the problem becomes 
NP-hard. On the other hand, when the number of copies of each item is 
fixed, we show that the problem of computing a minimum cost popular 
matching or deciding that no popular matching exists can be solved in 
O(mni) time, where m is the number of edges. 



1 Introduction 



The popular matching problem deals with matching people to items, where each 
person ranks a subset of items in an order of preference, with ties allowed. The 
input is a bipartite graph G = (AUB,E) where A is the set of people, B is the 
set of items and the edge set E = E\ U • • • U E r (E{ is the set of edges of rank 
i). For any a € A, we say a prefers item b to item b' if the rank of edge (a, b) is 
smaller than the rank of edge (a, b 1 ). If the ranks of (a, b) and (a, b') are the same, 
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then a is indifferent between b and b' . The goal is to match people with items 
in an optimal manner, where the definition of optimality will be a function of 
the preferences expressed by the elements of A. The problem of computing such 
an optimal matching is a well studied problem and several notions of optimality 
have been considered so far; for instance, pareto-optimality pQ, rank-maximality 
[1], and fairness. 

One criterion that does not use the absolute values of the ranks is the notion 
of popularity. Let M(a) denote the item to which a person a is matched in a 
matching M. We say that a person a prefers matching M to M' if (i) a is 
matched in M and unmatched in M', or (ii) a is matched in both M and M', 
and a prefers M(a) to M'(a). 

Definition 1. M is more popular than M' , denoted by M y M' , if the number 
of people who prefer M to M' is higher than those that prefer M' to M . A 
matching M* is popular if there is no matching that is more popular than M* . 

Popular matchings were first introduced by Gardenfors [3] in the context of 
stable matchings for two-sided preference lists (here both sides of the graph G 
express preferences). In the domain of one-sided preference lists, popular match- 
ings can be considered to be stable as no majority vote of people can force 
a migration to another matching; also, popularity is based on relative ranking 
rather than the actual ranks used by people. These properties make popularity 
a desirable notion of optimality in the domain of one-sided preferences. 

On the flip side, popularity does not provide a complete answer since there 
exist simple instances that do not admit any popular matching. An example is 
the following: let A = {ai, 02, 03}, B = {61, 62, 63}, and each person Oj, for i = 
1,2,3, prefers b\ to 62, and 62 to 63. Consider the three symmetrical matchings 
Mi = {(01,61), (02,62), (03,63)}, M 2 = {(ai,& 3 ), (a 2 ,6i), (a 3 ,o 2 )} and M 3 = 
{(oi, 62), (02, 63), (03, 61)}. None of these matchings is popular, since Mi -< M2, 
Mi -< M3, and M3 -< My. Abraham et al. [2] designed efficient algorithms for 
determining if a given instance admits a popular matching and computing one, 
if it exists. 

The fact that popular matchings do not always exist has motivated several 
extensions to the popular matching problem. McCutchen [TU] considered the 
problem of computing a least unpopular matching; he considered two measures 
of unpopularity and showed that computing a matching that minimized either of 
these measures is NP-hard. Kavitha et al. [5] generalized the notion of popularity 
to mixed matchings or probability distributions over matchings and showed that 
a popular mixed matching always exists. 

Our problem. Here we consider another natural generalization to the popular 
matching problem: augment the input graph G such that the new graph admits 
a popular matching. Our input consists of G — [A U B, E) and a function cost : 
B — > R + , where cost (6) for any b £ B is the cost of making a new copy of item 
b. The set B is a set of items, say books or DVDs, and new copies of any b £ B 
can be obtained by paying cost(6) for each new copy of b. There is no restriction 
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on the number of copies of any item that can be made. The only criterion that 
we seek to optimize is the total cost of augmenting G. 

Going to back to the earlier example on 3 people and 3 items that did not 
admit a popular matching, it is easy to show that by making a new copy of either 
b\ or 62, the resulting graph admits a popular matching. In order to minimize 
the cost, we will make a new copy of that item in {61, & 2 } which has lower cost. 
Our starting graph G = (A U B, E) comes for free, every addition that we make 
to G comes at a price and our goal is to make these additions such that the new 
graph admits a popular matching and the total cost of additions is minimized. 
We call this the min-cost augmentation problem. 

A related problem. A related problem is the following: we do not have a 
starting graph G. We are given a set A of people and their preference lists over 
a universe U of items where each item b € U has a price cost(fr) > associated 
with it. The problem is to "construct" an input graph G = (AUB,E) where B 
is a multiset of some elements in U such that G admits a popular matching and 
the cost of constructing G, that is, J2beB cos t(fr), is as small as possible. Here we 
also have an extra condition that the popular matching should leave no person 
unmatched, otherwise we have a trivial solution of B = 0. We call this problem 
the min-cost popular instance problem. 

The above problem can also be regarded as a "gift buying" problem. Each 
person in A has a preference list over gifts that she would like to receive. The 
problem is to buy a gift for each person in A with the total cost as small as 
possible and assign each person a gift such that this assignment is popular. That 
is, there is no reassignment of gifts that makes more people happier. This is the 
min-cost popular instance problem. 

1.1 Our Results 

We show the following results in this paper: 

1. The min-cost popular instance problem is NP-hard, even when each prefer- 
ence list has length at most 2 (i.e., every person has a top choice item and 
possibly, a second choice item). 

2. The min-cost augmentation problem has a polynomial time algorithm when 
each preference list has length at most 2 . 

3. The min-cost augmentation problem is NP-hard for general lists. In fact, 
it is NP-hard to approximate to within a factor of y / nT/2, where m is the 
number of people. 

All our NP-hardness results hold even when preference lists are derived from 
a master list. A master list is a total ordering of the items according to some 
global objective criterion. Thus if b\ precedes 62 in the master list and if a person 
a has both 61 and & 2 in her list, then it has to be the case that 61 precedes 62 in 
a's list. 
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We would like to contrast the NP-hardness of the min-cost augmentation 
problem with the problem of determining a popular matching with variable job 
capacities [7 ]. In this problem the input is a graph G = (AU B,E) where A is a 
set of people and B is a set of jobs, we are also given a list {c\ , . . . , c\b\) denoting 
upper bounds on the capacities of each job. The problem is to determine if 
there exists (x\, . . . , xigt) such that for each i, setting the capacity of the i-th 
job to Xi, where 1 < Xi < c^, enables the resulting graph to admit a popular 
matching. This problem was shown to be NP-hard in [7J ; however note that here 
the capacity of each job has an upper bound. Instead, if we only had to maintain 
an overall upper bound on the total increase of capacities rather than individual 
upper bounds, a simple polynomial time algorithm solves this problem [7]. 

In the min-cost augmentation problem recall that there is no upper bound on 
the amount that we can spend on a particular item. What we seek to optimize 
is the overall cost and this problem is NP-hard. Note that when each item has 
the same cost, then this problem can be solved in polynomial time (using the 
above algorithm from [7J). However when the costs come from {1, 2} the problem 
becomes NP-hard. 

The NP-hardness results for the min-cost augmentation/min-cost popular 
instance problems are because the number of copies of each of the items need to 
be determined so as to ensure the existence of a popular matching at minimum 
cost. Let copies(&) for any item b £ B denote the number of copies of item b in 
our graph G. We now consider the following problem: each b G B has a fixed 
number of copies denoted by copies(fr) and let the cost of a matching M be the 
sum of costs of items that are matched in M (we have to pay a cost of /c.cost(fe) 
if k copies of item b are used in M, where k < copies(6)). Our final result is a 
polynomial time algorithm for the min-cost popular matching problem which we 
define below. 



The min-cost popular matching problem is to determine if G admits a pop- 
ular matching or not and if so, to compute the one with minimum cost. 
We show that this problem can be solved in 0{mn\) time, where m is the 
number of edges and n\ is the number of people. Manlove and Sng consid- 
ered this problem without costs in the context of House Allocation. There 
items were called houses and copies of items as in our case were represented 
using capacities for houses. They called it Capacitated House Allocation 
with Ties (CHAT) and the problem was to determine if G admits a popular 
matching or not, and if so, to compute one. Manlove and Sng :9] showed an 
0((t/C + ni)m) algorithm for the CHAT problem, where C is the sum of 
capacities of all items. Thus, our algorithm improves upon the algorithm in 



1.2 Background 

Abraham et al. [2] considered the problem of determining if a given graph G — 
(A U B,E) admits a popular matching or not, and if so, computing one. They 
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also gave a structural characterization of graphs that admit popular matchings. 
Section [2] outlines this characterization and the algorithm that follows from it. 

Subsequent to this work, several variants of the popular matchings problem 
have been considered. One line of research has been on generalizations of the 
popular matchings problem while the other direction has been to deal with in- 
stances that do not admit any popular matchings. The generalizations include 
the capacitated version studied by Manlove and Sng [5], the weighted version 
studied by Mestre [H] and random popular matchings studied by Mahdian [5]. 
Kavitha and Nasre [5] as well as McDermind and Irving [11] independently stud- 
ied the problem of computing an optimal popular matching for strict instances 
where the notion of optimality is specified as a part of the input. Note that they 
also considered the min-cost popular matchings but in this version the costs are 
associated with edges whereas in our problem costs are associated with items. 
As described earlier, the line of research pursued for instances that do not admit 
popular matchings includes the NP-hardness of least unpopular matchings [TU] , 
the existence and algorithms for popular mixed matchings [5 and NP-hardness 
of the popular matchings problem with variable job capacities [7] . 

Organization of the paper. Section[5]discusses preliminaries. Section[3]shows that 
the min-cost popular instance problem is NP-hard. Section U has our results for 
the min-cost augmentation problem and Section [5] has our algorithm for the 
min-cost popular matching problem. 

2 Preliminaries 

We review the characterization of popular matchings given in [2]. Let G\ — 
{A U B,E\) be the graph containing only rank-1 edges. Then [3J Lemma 3.1] 
shows that a matching M is popular in G only if MnE\ is a maximum matching 
of G\. Maximum matchings have the following important properties, which we 
use throughout the rest of the paper. 

MPiEi defines a partition of AUB into three disjoint sets: a vertex u G AUB 
is even (resp. odd) if there is an even (resp. odd) length alternating path in G\ 
(w.r.t. Mflfii) from an unmatched vertex to u. Similarly, a vertex u is unreach- 
able if there is no alternating path from an unmatched vertex to u. Denote by 
£, O and U the sets of even, odd, and unreachable vertices, respectively. 

Lemma 1 (Gallai-Edmonds Decomposition). Let£, O and U be the sets 
of vertices defined by G\ and M l~l E\ above. Then 

(a) £,0 andlA are pairwise disjoint, and independent of the maximum matching 

Mr\E x . 

(b ) In any maximum matching of G\ , every vertex in O is matched with a vertex 
in £, and every vertex in U is matched with another vertex in U. The size 
of a maximum matching is \0\ + \U\/2. 

(c) No maximum matching of G\ contains an edge between a vertex in O and 
a vertex in OUU. Also, G\ contains no edge between a vertex in £ and a 
vertex in £ UU. 
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Since every maximum cardinality matching in G\ matches all vertices u £ 
OUW, these vertices are called critical as opposed to vertices u £ £ which are 
called non-critical. Using this partition of vertices, the following definitions can 
be made. 

Definition 2. For each a £ A, define f(a) to be the set of top choice items for 
a. Define s(a) to be the set of a's most-preferred non-critical items in G\. 

Theorem 1 (from |2j). A matching M is popular in G iff (i) M fl E\ is a 

maximum matching of Gi = (A U B,Ei), and (ii) for each person a, M(a) £ 
/(o)Us(a). 

The algorithm for solving the popular matching problem is now straightforward: 
each a £ A determines the sets /(a) and s(a). A matching that is maximum in 
G\ and that matches each a to an item in f(a) U s(a) needs to be determined. 
If no such matching exists, then G does not admit a popular matching. 

3 Min-cost Popular Instance 

In this section we consider the min-cost popular instance problem. Our input is 
a set A of people where each a £ A has a preference list over items in a universe 
U, where each item b £ U has a price cost (6) > 0. The problem is to "construct" 
a graph G or equivalently, set suitable values for copies(fr) where b £ U, in order 
to ensure that the resulting graph G admits a popular matching that matches 
all a £ A, at the least possible cost. 

We will show that the above problem is NP-hard by showing a reduction 
from the monotone l-in-3 SAT problem to this problem. The monotone 1-in- 
3 SAT problem is a variant of the 3SAT problem where each clause contains 
exactly 3 literals and no literal appears in negated form. The monotone l-in-3 
SAT problem asks if there exists a satisfying assignment to the variables such 
that each clause has exactly 1 literal set to true. This problem is NP-hard [13] . 

Let I be an instance of the monotone l-in-3 SAT problem. Let C\ , . . . , C m 
be the clauses in 1 and let X\, . . , , X n be the variables in I. We construct from 
I an instance of the min-cost popular instance problem as follows: 

Corresponding to each clause d — (Xj 1 V Xj 2 V Xj 3 ), we have 9 people 
Ai = {a\ . . . , ag}. Their preference lists are shown in Fig. [TJ In this case every 
person has a preference list of length 2, that is a top item followed by a second 
choice item. For instance, a\ treats item Uj l 9-S its rank-1 item and item as 
its rank-2 item. 

The items called public items and the items p\,p\,p\, and 

q 1 are called internal items. The internal items appear only on the preference 
lists of the people of Ai while the public items also appear on the preference 
lists of people in Aj as well as outside Ai . The public item Uj corresponds to the 
variable Xj. In every clause Ci that Xj belongs to, the item Uj appears in the 
preference lists of some of the people in the set Ai as shown in Fig. Q] 
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Fig. 1. The preference lists of people corresponding to the i-th clause in I. 

The set A of people in our instance is UiAi. The universe U of all items is 
the union of {iti, . . . , u n } (the n public items) and the set Ui{p\,p l 2,Pz, °f au 
the internal items. It remains to describe the costs of the items. For each i, the 
cost of each p\ for t — 1, 2, 3, is 1 unit, while the cost of q l is zero units. The cost 
of each uj, for j = 1, . . . , n, is 3 units. 

Recall that our problem is to determine a set B of items with suitable copies 
so that the graph (A U B, E) admits a popular matching that matches all a e A 
and we want to do this at the least possible cost. We first show the following 
lemma. 

Lemma 2. Any instance (AJB, E) that admits a popular matching that matches 
all a S A has cost at least lAm. 

Proof. Let us focus on the set Ai of people corresponding to clause C;. The 
preference lists of people in Ai are shown in Fig. [TJ Since the cost of each item 
on the lists of a\,a l 2 ,a l 3 is 3, we have to spend 9 units to buy an item each for 
these 3 people (since we seek an instance where all the persons get matched). 
People a\ , a\ , a\ have a unit cost item in their preference lists (items p\ , p\ , p\ , 
respectively). Thus, we have to spend 3 units to buy an item each for these 

3 people. Finally, a l 7 ,a z s ,a l 9 have a cost item, i.e. q l , in their preference lists. 
Hence, we can get q l with cop\es(q l ) = 3 for a cost of 0. Summarizing, we need 
to spend at least 9 + 3 + = 12 units for the people in Ai. 

However, it is not possible to spend just 12 units for the people in Ai. This is 
because, in the first place, we are forced to have non-zero copies for at least 2 of 
the 3 items in {uj 1 , Uj 2 , Uj 3 } and if we seek to match a\ to p\ while Uj 1 is around, 
then p\ is a|'s second choice item. Since p\ is a^'s top choice item, we also have 
to match a l 7 to p\ since a popular matching has to be a maximum cardinality 
matching on rank-1 edges (see Theorem [T]). Thus, it is not possible to match 
a\ to q % in a popular matching while p\ gets matched to a\ who regards this 
item as a second choice item because Uj 1 is around0 Thus we have the following 
options: 

(i) set copies(itj 1 ) — and then match a\ to p\ and a\ to q 1 

(ii) match both a\ and a\ to p\ by setting copies^) = 2 

(iii) add one more copy of Uj 1 and set copies^) ~ and thus match a\ to uj 1 
and a\ to q l 

4 A matching that contains the 3 edges (ai, UjA, (a\,p\), (a\, q 1 ) cannot be popular 
since by promoting a\ from q 1 to p\ , and a\ from p\ to Uj 1 , and leaving a\ unmatched, 
we get a more popular matching. 
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It is not possible to have option (i) for all the Uj 's since we are forced to have 
non-zero copies for at least 2 of the 3 items in {uj 1 ,Uj 2 ,Uj 3 \. Note that (ii) is 
always better than (iii) (cost of 2 units vs cost of 3 units). Hence it is always 
cheaper to match a\,a\,a,Q to p\,p\,p\ respectively than to any of Uj 1 ,Uj 2 ,Uj 3 . 

Thus, when we match a\,a l 5 ,a l 6 to p\,P2,P3 respectively, at least 2 of these 
3 people are getting matched to their second choice items. Hence, at least 2 out 
of the 3 people among a\, a l s , a l 9 will also have to be matched to their top choice 
items in order to ensure that the resulting matching is popular. This implies a 
cost of at least 9 + 3 + 2 = 14 for A t . 

This holds for each Ai, where 1 < i < m. Since the cost is at least 14 per 
clause, it amounts to at least 14m in total for all the clauses. □ 

The following lemma establishes the correspondence between the instance X 
of monotone l-in-3-SAT and the min-cost popular instance that we defined. 

Lemma 3. There exists an instance (A U B, E) with cost 14m that admits a 
popular matching that matches all a g A iff there exists a l-in-3 satisfying 
assignment for X. 

Proof. We know from Lemma [2] that any instance (A U B, E) that admits a 
popular matching that matches all a S A has a cost of at least 14m. What we 
need to show here is that (A U B, E) has cost 14m if and only if the l-in-3-SAT 
instance I is a "yes" instance, that is, there is a true/false assignment to the 
variables X\ , . . . , X n such that each clause has exactly 1 literal set to true (and 
thus 2 literals set to false). 

Suppose X admits such an assignment. We now show how to construct a set 
B of cost 14m such that the instance (AUB, E) admits a popular matching that 
matches all a G A. If Xj = true then set copies(iti) = 0, else copies(iti) will be 
set to a suitable strictly positive value. 

Since the setting of true/false values to X^s is a satisfying assignment, every 
clause has two literals set to false and 1 set to true. Let clause Ci be (Xj 1 V 
Xj 2 V Xj 3 ). Thus there is 1 variable Xj k in {Xj 1 ,Xj 2 ,Xj 3 } that has been set 
to true. By our definition of copies of every item, the corresponding Uj k has 
copies. Hence the people in the set Ai can be matched as follows: 

— a\,a 2 ,a\ get matched to the 2 items in {uj t , Uj 2 , Uj 3 } \ {uj k } by having 2 
copies of one of the lower indexed item and 1 copy of the higher indexed 
item for these 3 people. 

— p l k becomes a k+3 's top choice item (since Uj k does not exist in the graph 
now) and hence we can now match a k+3 to p\ and a k+6 to q l . 

This way we spend only 9 + 3 + 2 = 14 units for the people in Ai and each 
person a has an item in f(a) U s(a) to be matched to. Since every clause in X 
has exactly 1 variable set to true and 2 set to false, we achieve a cost of 14 for 
each set Ai. This shows that we can construct a set B of cost 14m such that 
(A U £>, E) admits a popular matching that matches all a £ A. 

To show the other direction, let us set the true/false values of variables in X 
as follows: for each j = 1, . . . , n set Xj — true if and only if copies(itj) = 0. We 
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need to show that such an assignment sets exactly 1 variable in each clause to 
be true. 

Let us consider any clause Cj = {Xj 1 V Xj 2 V Xj 3 ). Among the 3 items 
Uj 1 , Uj 2 , uj 3 that correspond to these 3 variables we need at least 2 items to have 
non-zero copies so as to match all the 3 people a\,a\,a\. Thus, our true/false 
assignment does not set more than 1 variable per clause to true. 

We now need to show that there is at least 1 item in {uj 1 , Uj 2 , Uj 3 } with zero 
copies. This is where we will use the hypothesis that we can construct (AUB, E) 
of cost 14m that admits a popular matching that matches all a E A. It follows 
from the proof of Lemma [2] that each set Ai of people corresponding to a clause 
needs a cost of at least 14. Since the total cost is only 14m and there are m 
clauses, this implies that we have to spend exactly 14 per clause. In other words, 
the items for the 9 people of each Ai have to be bought using only 14 units. 

If all the 3 items in {uj 1 , Uj 2 , Uj 3 } have non-zero copies, then this implies the 
cost of items for all the 9 people in Ai will be 9 + 3 + 3 = 15 since when each 
uj k has at least one copy, then the Uj^s become top choice items for a\,a\,a l & , 
respectively and thus p\,P2,p\ become their second choice items. This forces us 
to match each of a\,a\,a\ to their top choice items (that is, p\,p\,p\, respec- 
tively) since a popular matching has to be a maximum cardinality matching on 
rank-1 edges. However, we are given that we can spend only 14 units per A%; 
thus it has to be the case that there exists at least 1 item in {uj 1 , Uj 2 , Uj 3 } which 
has zero copies. This finishes the proof of this lemma. □ 

Note that the preference lists of all the people in our instance G are strict 
and of length at most 2. Also, the preference lists are drawn from a master list. 
We have thus shown the following theorem. 

Theorem 2. The min-cost popular instance problem is NP-hard, even when 
each preference list has length at most 2. Further, the hardness holds even when 
the preference lists are derived from a master list. 

4 Min-cost Augmentation 

In this section we show various results for the min-cost augmentation problem. 
Recall that the input here is a graph G = (A U B, E) where each item b G B 
has a non-negative cost(fr) associated with it. The problem is to determine how 
to make extra copies of items in B so that the resulting graph admits a popular 
matching and the cost of the extra copies is minimized. 

Unlike the min-cost popular instance problem, the above problem admits a 
simple polynomial time algorithm when each a € A has a preference list that 
is strict and of length at most 2. We describe this algorithm below. We assume 
throughout this section that we add at the end of each a's preference list a 
dummy item called the last item t a , where a being matched to l a amounts to a 
being left unmatched. 
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4.1 Preference lists of length 2 



For any a £ A, a's preference list consists of a top choice item (let us use f a to 
denote this item), and possibly a second choice item (let us use z a to denote this 
item) and then of course, the last item l a that we added for convenience. Let 
G\ be the graph G restricted to rank-1 edges. Let the graph G' = {A U B, £"), 
where E' consists of 

— all the top ranked edges (a, f a ): one such edge for each a £ A, and 

— the edges (a,s a ), where a is even in G\ and s a is a's most preferred item 
that is even in G\. Thus s a — z a when z a is nobody's top choice item, else 

It follows from Theorem Q] that G admits a popular matching if and only if 
G' admits an ^4-perfect matching. We assume that G does not admit a popular 
matching and we have to decide now which items should be duplicated and how 
many extra copies should be made. Since G' does not admit a popular matching, 
there exists a set S of people such that the neighborhood N(S) of S in G' satisfies 
|-/V(5)| < \S\. Let S denote a minimal such set of people. It is easy to see that 
every a £ S must be even in G\. Thus, for each a £ S 1 , the edge (a, s a ) belongs 
to G' and it must be that s a = z a . Otherwise s a — £ a and since no vertex in 
A other than a has an edge to £ a , such an a will be always matched in any 
maximum cardinality matching in G' . Hence, such an a cannot belong to S due 
to its minimality. Further note that for any such minimal set S, the set N(S) 
is a set of items that are all odd in the graph G' with respect to a maximum 
cardinality matching in G' . 

Since s a — z a for every a £ S, and the preference lists are of length at most 
2, there are no items sandwiched between f(a) and s(a) in a's preference list 
for every a £ S. Thus, in order to ensure that these people get matched in any 
popular matching, we need to make extra copies of items in N(S) or equivalently 
items that are odd in the graph G' . Our algorithm precisely does this and in order 
to get a min-cost augmentation, it chooses the odd item in G' which has least 
cost. The steps of our algorithm are described in Algorithm [1] 

Our algorithm maintains the invariant that no person a changes her s-item 
due to the increase in copies. This is because we ensure that no top choice item 
b ever becomes even in H}, the graph Hi restricted to rank-1 edges. Note that 
the set of odd items in Hi is identified by constructing alternating paths from a 
person who is unmatched in Hi and every item b that appears on such a path is 
always odd. Further, our duplications ensure that the total number of copies of 
an item 6 in any augmented instance Hi is bounded by the degree of 6 in G' . Li- 
the case of a top choice item 6, the degree of b in G' is equal to the degree of b 
in Gi, the graph G restricted to rank-1 edges. Thus, even with the extra copies, 
a top choice item remains critical in the augmented graph restricted to rank-1 
edges. This implies that for every person, the most preferred even item in the 
augmented graph restricted to rank-1 edges (i.e., its s-item) remains unchanged. 

We note that the above claim also implies that in every iteration of the while 
loop in Step 4 of our algorithm, the size of the maximum cardinality matching 
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Algorithm 1 Min-cost augmentation for strict lists of length at most 2. 

1: Construct the graph G' = (AllB, E') where E' = {(a, b) :a€A,b£ f(a) U s(a)}. 
2: Ho = G, H' = G' , Let Mo denote a maximum cardinality matching in H' . 
3: i = 0. 

4: while Mi is not ,4-complete matching do 

5: Partition the vertices into odd (O), even (£), unreachable (U) w.r.t. Mi. 
6: Let b denote the cheapest item in B PI O. 
7: Set copies(fc) = copies(fr) + 1. 

8: Construct the graph H' i+1 corresponding to Hi+i and update Mi+i to be a 
maximum cardinality matching in H' i+1 . 

9: i = i + l. 
10: end while 
11: Output the graph Hi. 



increases by 1, that is, |-Mj_j_i| = \Mi\ + 1. Therefore, the while loop terminates 
in k = \A\ — \Mo\ iterations. Since k is bounded by m, the number of applicants 
in G, the running time of our algorithm is 0(n\). It is clear that the graph Hi 
returned by the algorithm admits an ,4-complete matching the graph H[ and 
hence admits a popular matching. To see that the instance returned is a min-cost 
instance, observe that there is no alternating path between an item b which got 
duplicated in our algorithm and an item b' whose cost is strictly smaller than 
the cost(&). Otherwise in the iteration when b was odd, so was b' and it would 
have been picked up by our algorithm. 

We can therefore conclude the following theorem. 

Theorem 3. The min-cost augmentation problem with strict preference lists of 
length at most 2 can be solved in 0(n\) time. 

4.2 Hardness for the general case 

We now show that the min-cost augmentation problem in the general case is 
NP-hard. The reduction is again from the monotone l-in-3 SAT problem (refer 
to Section [3]) . Let X be an instance of the monotone l-in-3 SAT problem. Let 
C\ , . . . , C m be the clauses in I and let Xi, . . . , X n be the variables in X. We 
construct from X an instance of the min-cost augmentation problem as follows. 

Let d be (Xj 1 V Xj 2 V Xj 3 ). Corresponding to this clause we have 6 people 
Ai = {a\, a\, a\, a\, a\, Oq} and 3 internal items Di = {pi, q^Ti}. In addition we 
have public items Uj x , Uj 2 , Uj 3 which belong to preference lists of people in Ai 
and whenever Xj occurs in a clause Cj , the item Uj will belong to the preference 
lists of some people in Ai . The public items have unit cost whereas each internal 
item b E Di has cost 2. The preference lists of the people in Ai are shown in 
Fig.H 

The set B of items is the union of U™ x Di (the set of all the internal items) 
and {u\,...,u n } (consisting of all the public items, where vertex uj corre- 
sponds to the j-th variable Xj). The set A of people is the union of U^ 1 A i 
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Fig. 2. Preference lists of the 6 people in A± 



and {xi, . . . , x„}, where the vertex Xj corresponds to the variable Xj. The pref- 
erence list of each Xj is of length 1, it consists of the item Uj. 

G has no popular matching. It is easy to see that the graph G described 
above does not admit any popular matching. To see this, hrst note that each 
public item Uj is a unique rank-1 item for exactly one applicant Xj. Hence when 
every item has a single copy, these public items are unreachable or critical in G\ 
(the subgraph of rank-1 edges in G). Now let us consider the people in Af. for 
each a\ <G {a\,a 2 ,a l 3 }, we have f(a\) = {pi} and s(a\) = {qi}. Since there are 
only 2 items Pi,qi for the 3 people a\,a 2 ,a 3 to be matched to in any popular 
matching, G does not admit a popular matching. 

Let G be a min-cost instance such that G admits a popular matching. We 
now state the following lemma that establishes the reduction. 

Lemma 4. G has cost at most m iff there exists a l-in-3 satisfying assignment 
for the instance I. 

Proof. Assume that there exists a l-in-3 satisfying assignment for I. For each j, 
let Cj denote the number of clauses in which Xj appears. We will set the copies 
of the items in the following manner: the copies of the internal items remain the 
same, i.e., copies(fe) = 1 for each b G Uj-Di and the copies of the public items are 
set as follows. 

For each j, where 1 < j < n do: 

• if Xj = true, then set copies(uj) = 1 + c j 

• else copies(wj) remains 1. 

Let us determine the cost of this augmentation. For every Xj that is true, 
we pay a cost of Cj ■ 1 = Cj and for Xj that is false, we pay nothing. Since each 
clause has exactly one variable set to true, we have: J2j-x =true c i ~ m - Thus 
the cost of our augmentation is m. 

We now show that the graph G' admits an ,4-complete matching (the edges 
in G' are (a, b) where b e f(a) U s(a)). 

— Consider the people xi, . . . , x n . Each Xj gets matched to her /-item Uj. 

— Consider the people in A{. We know that exactly one amongst Uj 1 ,Uj 2 ,Uj 3 
has more than one copies (since the number of copies was based on a satis- 
fying assignment for l-in-3 SAT). If cop\es(uj k ) > 1, then a\ gets matched 
to Uj k and the 2 people in {a\, a\, a\} \ {a l k } get matched to pi and qi. Fi- 
nally, a l k+3 gets matched to her top choice item whereas the 2 people in 
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{a\,a\,a\} \ {a l k+3 } get matched to their last items (their most preferred 
even item in G\). 

To prove the other direction, assume that the cost of G is m. We now translate 
this into truth values for variables in X. If copies(uj) > 1 in G, then set variable 
Xj = true, else set Xj = false. We need to show that this is a l-in-3 satisfying 
assignment for X. 

Since the cost of adding one copy of any item is at least 1, we need to pay 
at least 1 unit per clause in order to match the people in Ai. Thus, we need 
to pay at least m to get a graph that admits a popular matching. However, we 
are given that with a cost of exactly m, the graph G that admits a popular 
matching. Hence, the copies of items have been added such that exactly 1 unit 
has been spent per clause. 

Spending 1 unit has allowed all the people in Ai, for each i, to have enough 
items to match themselves to in G' . Consider the items that occur in the pref- 
erence lists of people in Ai (refer to Fig. [5]) . Since the cost of each internal item 
is 2 and we cannot afford a cost of 2 for any clause, it has to be the case that 
copies(u) > 1 for some u S {uj 1 ,Uj 2 ,Uj 3 }. Thus, we have at least 1 true variable 
per clause in X. 

We now have to show that there is exactly 1 true variable per clause in 
X. The point to note is that copies(w) > 1 for any public item u implies that 
u is non-critical in G\. This changes the most preferred even item in G\ for 
some people. That is, suppose k items in {uj t , uj 2 , uj 3 } have more than 1 copies. 
Then, we have k non-critical items in {uj 1 ,Uj 2 ,Uj 3 } and so we have k people 
in {a\,a\,a\} satisfying the following: a's most preferred even item in G\ is no 
longer the last resort item l a , it is now the non-critical public item that is second 
in a's preference list. 

Observe that one person m{a\,a\,a\} can be matched to her top choice item 
ri. However, to match the second person we need to spend another unit. In the 
first place, we have already spent 1 unit to add an extra copy of some uj k to match 
all the people in {a\, a\, a\} . With more than one item in {uj 1 ,Uj 2 ,Uj 3 } non- 
critical in G\ , we have pay at least 2 units for the people in Ai . This contradicts 
the fact that we spent exactly 1 unit for the people in Ai . Hence there is exactly 
1 true variable per clause in X. □ 

We can now conclude the following theorem. 

Theorem 4. The min-cost augmentation problem is NP-hard, even for strict 
lists of length at most 3. Further, the lists can be derived from a master list. 

4.3 Inapproximability of min-cost augmentation 

We extend the above reduction from X to show that this problem is NP-hard to 
approximate to within a factor of yJrT\j2, where n\ is the size of A. We construct 
a graph H on at most 4m 4 people that satisfies the following property: 
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(*) If I is a yes instance for l-in-3 SAT, then H can be augmented at a cost of 
m to admit a popular matching. If I is a no instance for l-in-3 SAT, then 
H needs a cost strictly greater than m 3 to admit a popular matching. 

We describe the construction of the graph H below. Recall that I has m clauses 
and corresponding to each clause Ci, we have a set A of people. The construction 
of H is as follows. Let us call the group of 3 people (a\, a\,a\) in Fig.[S]a triplet. 
Instead of having just one triplet in A> as was the case in the previous section, 
here we have many such triplets. In particular, we have m 3 + 1 such triplets. The 
preference list for one particular triplet {a\ t+1 , a\ t+2 , 034+3) is shown in Fig. [3] 



a 3t+l 


r* 


«% 




a 3t+2 


r* 






a 3t+3 


r* 


U J3 





Fig. 3. Preference lists of people corresponding to the t-th triplet. 

We now have 3 -I- 3(m 3 + 1) people in A;, namely a\,a 2 ,a\ and 3 people 
per triplet, for each of the m 3 + 1 triplets. Thus our overall instance H has 
m(3 -I- 3(m 3 + 1)) (the people in U,Aj), plus the n people in {xx, ■ ■ ■ , x n }. Since 
each clause has 3 variables, n < 3m. Thus we can bound ni, the number of 
people in H as: n\ < 3m + 9m < 4m for m > 3. 

Recall that for each j, the preference list of Xj is of length 1, which consists 
of only Uj. The costs of the items are as follows: the cost of each of the internal 
items, i.e., Pi,qi, and r* , for k — 1, . . . , m 3 + 1 is m 3 , and the cost of each Uj for 
j = 1, . . . , n is 1. We now show that the instance constructed as above satisfies 
the property (*). 

Lemma 5. If I is a yes instance for l-in-3 SAT, then H can be augmented at 
a cost of m to admit a popular matching. If T is a no instance for l-in-3 SAT, 
then H needs a cost strictly greater than m 3 to admit a popular matching. 

Proof. We first consider the case when T is an yes instance. The proof is similar 
to that of Lemma |H For each j, where 1 < j < n, do the following: if Xj — true, 
then set copies(uj) = 1 + Cj, where Cj is the number of clauses in which Xj is 
present. Else set copies(uj) = 1. The total cost involved here is Ylj : x =true c r 
Since each clause has exactly one variable set to true, we have: Ylj-.x-=true c i = 
m. Thus, the cost of our instance H is in. It is easy to show that the graph H 1 
admits an A-complete matching. 

— Consider the people x\, . . . , x n . Each Xj gets matched to her /-item Uj. 

— Consider the people in A- We know that exactly one amongst Uj 1 ,Uj 2 ,uj 3 
has more than one copies (since the number of copies was based on a satis- 
fying assignment for l-in-3 SAT). If copies(uj Jc ) > 1, then a\ gets matched 
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to uj k and the 2 people in {a\, a 2 , a\} \ {a % k } get matched to pi and qi. For 
each of the to 3 + 1 triplets that we have here, we do as follows. The per- 
son ag i+fc gets matched to her top choice item r\ whereas the 2 people in 
{«3t+i: a 3t+2>«3t+3} \ { a lt+k\ S et matched to their last items. 

This proves that H can be augmented at a cost of exactly m to admit a 
popular matching. 

We now prove the other direction, that is, if I is a no instance for l-in-3 SAT, 
then H needs a cost of at least to 3 + 1 to admit a popular matching. Suppose H 
can be augmented at a cost of at most to 3 to admit a popular matching. We will 
show that this translates to a l-in-3 satisfying assignment for X. Let H denote 
the augmented graph. Let us set the truth values of variables in I as follows. 
Set Xj = true iff copies(uj) in H is greater than 1. 

We have only to 3 units available to make extra copies so that people in each 
set Ai have items in H' to match themselves to. Recall that the cost of each 
internal item is m 3 . Hence it is easy to see that we cannot afford an extra copy 
of any internal item and thus at least one public item in {uj 1 ,Uj 2 ,Uj 3 } should 
have more than one copy to match all of a\,a 2 ,a\. Otherwise there are only 2 
items p % and q % for these 3 people to be matched to; since the first copies of 
uj 1 , Uj 2 , Uj 3 will be matched to Xj 1 , Xj 2 , Xj 3 , respectively. Thus, we have shown 
that at least one of Uj 1 , Uj 2 , Uj 3 has more than one copy. Hence in our assignment 
of truth values, there is at least 1 variable in each clause that is set to true. 

Suppose 2 or more of the items in {uj 1 , Uj 2 , Uj 3 } have more than one copy in 
H. We have two people in {a\,a 2 ,a l 3 } having their most preferred even item in 
Hi as an item in {uj x , Uj 2 , Uj 3 }. In addition, in each of the m 3 + 1 triplets, two 
people have their most preferred even item in {uj 1 ,Uj 2 ,Uj 3 }. Although, one of 
these 2 people from each triplet can be matched to her unique top choice item, 
we still need to spend m 3 + 1 for all the people in Ai to be matched to items 
in H' . This contradicts the hypothesis that H can be augmented a cost of at 
most m 3 into H. Hence for each i, there is exactly 1 item in {uj 1 , Uj 2 , Uj 3 } that 
has more than one copy in if. In other words, for each i, there is exactly 1 true 
variable in the i-th clause. Thus, our assignment is a l-in-3 satisfying assignment 
for X. a 

Now suppose that the min-cost augmentation problem admits a y/n~i/2 ap- 
proximation algorithm. Call this algorithm Algol. If I is a yes instance, then 
Algol has to return an augmentation of cost at most l/2.\/4m 4 .m = to 3 . If X 
is a no instance, then there is no augmentation of cost at most to 3 , so Algol 
returns an answer of cost greater than to 3 . Thus using Algol it is possible to 
determine whether X has a l-in-3 satisfying assignment or not, a contradiction. 
Hence we conclude the following theorem. 

Theorem 5. It is NP-hard to approximate the min-cost augmentation problem 
on G = (AU B, E) within y/\A\/2. 
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4.4 Perfect augmentation 

In this section we consider a variant of the min-cost augmentation problem where 
we are not content with only a popular matching, but insist on every applicant 
being matched to an item. Note that till now we allowed people to be matched 
to their last items which was equivalent to leaving them unmatched. We call 
a popular matching that matches all the people as a perfect popular matching 
and denote the problem as min-cost perfect augmentation. We show that this 
problem becomes NP-hard even when preference lists are strict and have length 
at most 2 in contrast to the min-cost augmentation problem which has a simple 
polynomial time algorithm for strict lists of length at most 2. 

The overall reduction is similar to the min-cost augmentation problem where 
we reduce from an instance I of monotone l-in-3 SAT problem. As earlier, 
corresponding to every variable Xj in I we have an applicant Xj in our instance. 
The preference list of applicant Xj contains only one item Uj. Let Cj = (Xj 1 V 
Xj 2 V Xj 3 ) be a clause in X. Corresponding to clause Ci we have a set of 6 
people Ai = {a\, . . . , a|} and a set of internal items Di = {pi, qi}. The internal 
items appear on preference lists of people only in Ai. In addition public items 
Uj 1 ,Uj 2 ,Uj 3 also appear on the preference lists of people in Ai. The preference 
lists of people are as shown in Fig. SJ 
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Fig. 4. Preference lists of the 6 people in Ai 



We define the sets A and B below. 

A = U™ 1 A l U{x 1 ,...,x n }. 
B = U? =1 D l U{u 1 ,...,u n }. 

The graph G — (Ai) B,E) with the preference lists as defined above is our 
instance G. It remains to describe the costs of items in G. The cost of every 
public item Uj is 1 whereas the costs of every internal item is m. With a single 
copy of every item, all public items are critical in the graph G±. Hence people 
a\, a\,a\ belonging to Ai treat their unique last resort items as their s- items. 
Further people a\,a\, a\ treat the internal item qi as their s-item. It is easy to 
verify that the graph G described above does not admit a popular matching. 

Let G be an augmented instance that admits a perfect popular matching. 
In any perfect popular matching in G every applicant Xj has to be matched to 
her /-item i.e. Uj. Now let us consider the people in Ai. Since the internal items 
are of cost m each and public items have cost 1, it is cheaper to make extra 
copies of public items. Further, recall that any popular matching has to be a 
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maximum cardinality matching on rank-1 edges. Therefore in order to match 
one of {a\, a\, a\} (say a\) to its s-item Uj lt we need to make 2 extra copies 
of Uj 1 thus making Uj 1 non-critical in G\. This ensures that applicant a\ gets 
matched to a copy of Uj 1 , her /-item. It is easy to see that in order to get a 
perfect popular matching, we need to make at least two public items non-critical 
in the graph G. Thus we spend an amount of at least 4 units per clause in any 
augmented instance G that admits a perfect popular matching. 

We now show the following lemma which proves the correctness of our re- 
duction. 

Lemma 6. The min-cost instance G corresponding to G that admits a perfect 
popular matching has cost 4m iff there exists a l-in-3 satisfying assignment for 
X. 

Proof. Assume that there exists a l-in-3 satisfying assignment for X and let 
Truthval denote the l-in-3 satisfying assignment for X. For every variable Xj, 
let Cj denote the number of clauses in which Xj appears. We set the copies of 
the items in G as follows: If Truthval(Xj) = false, then copies(uj) = 2cj + 1, 
otherwise copies(uj) = 1. Thus we pay 2cj units for the extra copies of item Uj 
whenever the corresponding variable Xj is set to false. 

Since X has a l-in-3 satisfying assignment, each clause has exactly two vari- 
ables set to false. We therefore have: 

2cj = 4m. 

j:Truthval(Xj) — false 

Thus the cost of our instance G is 4m. We now show that in the graph G, every 
applicant has an item to match amongst her / or s items. 

— Consider the people xi, x<i, . . . , x n . Each Xj gets matched to her /-item Uj. 

— Consider the people in Ai corresponding to the clause d — (Xj 1 VXj 2 VXj 3 ). 
We know that exactly two among Uj 1 , Uj 2 , Uj 3 are non-critical. Assume that 
Uj 1 and Uj 2 are non-critical, then people a\ and a\ get matched to their 
s-items Uj 1 and Uj 2 respectively. Further a\ and a\ get matched to their 
/-items Uj 1 and Uj 2 respectively. Finally, a\ gets matched to her /-item pi 
whereas a\ gets matched to her s-item g,. 

To prove the other direction, assume that G admits a popular matching and 
has cost 4m. We now translate this into the truth values for variables in X. If 
copies(wj) > 1 in G, then set variable Xj = false else set Xj = true. We need 
to show that this assignment is a l-in-3 satisfying assignment. 

Recall that for every clause we need to spend at least 4 units in order to 
match all the people in A^. Since the cost of G is 4m, this implies that exactly 
4 units have been spent per clause for making extra copies. 

Now consider the people in Ai. The internal items occurring on their prefer- 
ence lists of these people have cost m each, hence none of the internal items have 
extra copies in the augmented instance. This implies that at least 2 public items 
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amongst {uj 1 , Uj 2 , Uj 3 } have more than one copy in the augmented instance. 
Thus our truth assignment sets at least 2 variables to false. It remains to prove 
that for every clause exactly 2 variables are set to false. Assume not. 

Let Ci be a clause in which our truth assignment set all the three variables 
Xj x , Xj 2 , Xj 3 to false. This implies that in the instance G all the three public 
items Uj 1 , Uj 2 , tij 3 have more than one copy. The first copy of these public items 
is utilized in matching people Xj t , Xj 2 , Xj 3 respectively to their /-items. Further 
since any popular matching is a maximum cardinality matching on rank-1 edges, 
the second copies of these items will have to be assigned to people a\, a\, a\. Thus 
after spending 3 units we are still left with 2 people amongst {a\, a|, a|} without 
any item. Hence we need to spend 2 more units to match 2 of these people, which 
implies that we spent 3+2=5 units for people in A}. This contradicts the fact 
that exactly 4 units was spent in order to match people in Ai, for all i. Thus 
exactly one variable in every clause is set to true. □ 

The following theorem is immediate from the above lemma. 

Theorem 6. The min-cost perfect augmentation problem is NP-hard for strict 
lists of length at most 2. 

5 Min-cost Popular matchings 

In this section we present an 0(mni) time algorithm for the min-cost popular 
matchings problem, where m — \E\ and ri\ = \A\. Our input is an instance 
G = (A U B, E) where each item b £ B has associated with it the number 
copies(fe) (denoting the maximum number of people that can be matched to b) 
and a price cost(fe) > 0. Whenever a person gets matched to b, an amount of 
cost(6) has to be paid. Thus if k < copies(6) copies of b gets used in a matching 
M, then a cost of k ■ cost(6) has to be paid by M. As done in the earlier sections, 
we will add a last item l a at the end of a's preference list for each person a € A. 
The cost of l a is 0, since using the edge (a, £ a ) amounts to leaving a unmatched. 

Our problem here is to decide whether G admits a popular matching or not 
and if so, to compute the one with minimum cost. As mentioned in Section [1] 
Manlove and Sng considered the popular matchings problem (referred to as the 
CHAT problem) where items (these were called houses) have capacities and they 
showed an 0({-\/~C + ni)m) algorithm for this problem, where C is the sum of 
all the capacities. 

In order to solve the min-cost popular matchings problem, for each b £ B, 
we could make copies(6) copies of each vertex b, call them b\, . . . , & cop ie 5 (&), where 
each bt has the same neighborhood as the original vertex b. However, such a 
graph has too many vertices and edges, hence we will stick to the original graph 
G = (AU B,E) and simulate the larger graph in G itself. Note that a matching 
in G can contain up to copies(6) pairs (ai, b). It is easy to see that the structural 
characterization for popular matchings from [2 holds for our problem as well. 
That is, any popular matching in our graph G has to be a maximum cardinality 
matching on rank-1 edges and every person a has to be matched to an item in 
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f(a) U s(a). This is because by making copies(fe) many duplicates of every item b 
in G our problem becomes equivalent to the original popular matchings problem. 

5.1 Our algorithm 

Our algorithm to compute a min-cost popular matching can be broadly parti- 
tioned into two stages. In the hrst stage we build the graph G', i.e. the graph 
where every person adds edges to their / and s-items. Identifying s-items for 
people involves partitioning the vertices of G into odd, even and unreachable 
with respect to a maximum cardinality matching on rank-1 edges. We show how 
to efficiently do this by building Hungarian trees rooted at unmatched vertices. 
The second stage then computes a min-cost popular matching in the graph G' 
if one exists. 

The first stage. We first construct the graph G\ which is the graph G restricted 
to rank-1 edges. In order to find a maximum cardinality matching in the graph 
G\, we use Ford-Fulkerson max- flow algorithm. The following transformation 
from Gi into a flow network is based on the standard transformation from the 
bipartite matching problem to the maximum flow problem: 

— add a vertex s and an edge directed from s to each person a £ A with an 
edge capacity of 1 on this edge. 

— add a vertex t and an edge directed from each item b € B to t with an edge 
capacity of copies(&) on this edge. 

— direct every edge (a, b) of G from a to b and set an edge capacity of 1 for 
each such edge. 

Let F(Gi) denote the above graph. It is easy to see that a valid flow from s 
to t in the graph F(G\) can be translated to a matching in G\ in which every 
person is matched to at most 1 item and every item b is matched up to copies(fe) 
people. A maximum flow in F(G\) becomes a maximum cardinality matching 
in G\ . We compute a maximum cardinality matching Mq of G\ by computing a 
max-flow from s to t in F(G\). Using the matching M , our goal is to obtain a 
partition of AiiB into O (odd), £ (even) and U (unreachable). This can be done 
in C(number of edges) provided we create copies(fe) many duplicates of each item 
p and duplicate the neighborhood of b for each copy of b. However this is too 
expensive. The main point to note is that all the copies(fe) many copies of b, for 
each item b, have the same odd/even/unreachable status. We show below that 
we can remain in the graph G\ and determine the odd/ even /unreachable status 
of all the vertices in linear time. 

1. We begin with O = £ = U = 0. 

2. We then add to the set £ all people that are unmatched in Mq and all items 
that are not fully matched by Mo (i.e. a item b that is matched to fewer than 
copies(fe) many people). This is because if we would have made copies(6) many 
duplicates of b, some of the copies would have remained unmatched by Mq 
and the other copies which are matched would be connected by even length 
alternating paths from these unmatched vertices. 
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3. Our goal now is to build a Hungarian tree T u for each vertex u that is 
unmatched or not fully in Mq. In order to do so we first set all vertices as 
unmarked. We build the trees rooted at unmatched people and not fully 
matched items as described below: 

(a) For u & A that is unmatched, the children of u in T u are all the neigh- 
boring items of u that are unmarked so far. For each of these items b 
the children of b in T u are all the unmarked people matched to b. The 
children of these people are their neighboring unmarked items and so on. 
As soon as a vertex gets visited in T u we mark it. 

(b) For u G B the children of u are all the neighboring unmarked people of 
u. Note that some of these people could be matched to u - however, we 
will include all these people since we are simulating the Hungarian tree 
rooted at an unmatched copy of u. We mark each person in this child 
list. 

Each person a in the above child list had a unique child, the item to 
which a is matched. If this item is marked, then a is a leaf in this tree, 
else we add M (a) to the tree and mark it. We now continue to explore 
the unmarked neighborhood of Mo (a) for all non-leaf people a. 

(c) Once T u is built, all vertices that belong to even levels of T u (the root is 
at level 0) are added to £ and all vertices that belong to odd levels are 
added to O. 

4. Once we finish building all the trees T u , where u is unmatched person/not 
a fully matched item, the set U gets set to the vertices of AU B\0 U £ as 
there is no alternating path from an unmatched vertex to such vertices. 

We note that while building a tree T u , we explore the neighborhood of a 
vertex only if this vertex is unmarked and then this vertex immediately gets 
marked. This ensures that a vertex occurs just once across all T M 's. Having 
obtained the partition, it is now possible to define s(a) for every person a as the 
most preferred even item of a. Let the graph G' be the graph G\ along with the 
edges (a, b) where a G £ and p G s(a). 

Since a popular matching is a maximum cardinality matching on rank-1 
edges, all items that are critical in G\, that is, all items in O U U have to be 
fully matched in every popular matching M* of G. The only choice we have 
is in choosing which items of £ should participate with how many copies in 
the min-cost popular matching. We make this choice in the second stage of our 
algorithm. 

The second stage. Our goal in the second part of the algorithm is to augment 
the matching Mq to find a min-cost popular matching. However, we start with 
the matching Mi, where Mi = M \ {all edges (a, b) where a G O}. Thus Mi 
consists only of edges (a, b) where b G OUW. We take Mi to be our starting 
matching rather than Mq because it may be possible to match people O P\ A 
to cheaper rank-1 neighbors. Recall that while computing the max-flow Mo, the 
costs of items played no role. 
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Now let p be an augmenting path with respect to Mi, i.e., one end of p is an 
unmatched person and the other end of p is item b that is not fully matched. The 
cost of augmenting the current matching along p is the cost of b. By augmenting 
the current matching along p, every item other than b that is currently matched 
stays matched to the same number of people and the item b gets matched to one 
more person. Thus the cost of the new matching is the cost of the old matching + 
cost(6). In order to match an unmatched person a, our algorithm always chooses 
the cheapest augmenting path starting from the person a. 

To find the cheapest augmenting path we build a Hungarian tree T a rooted 
at every person that is unmatched in Mi . Initially all vertices are unmarked and 
while building T a every visited vertex gets marked so that each vertex occurs 
at most once in T a . We do not terminate the construction of T a as soon as we 
find an augmenting path, but we build T a completely in order to find a min-cost 
item b such that there is an augmenting path between and a and 6; we augment 
Mi along this path to obtain M-z. On the other hand if T a has no augmenting 
path then we quit and declare "G does not admit a popular matching" . 



Algorithm 2 Algorithm for min-cost popular matching. 
1: Construct the graph G\ = (All B, Ex) where Ei = {(a, b) : a € A, b € /(a)}. 
2: Construct the flow graph F(Gi) by adding two vertices s and t and adding directed 

edges with appropriate capacities. 
3: Compute a maximum flow in F(Gi) and translate the flow to a matching Mo in 
Gi. 

4: Obtain a partition of the vertices of G as odd (O) , even (£) and unreachable (U) 
using Mo. 

5: Construct the graph G' = (AuB, E') where every person adds edges to her /-items 

and every even person adds edges to her s-items. 
6: Delete from G' all OO and OU edges. 

7: Delete from Mo all edges that are incident on odd people in G' and call the resulting 

matching Mi. 
8: i = 1. 

9: while there exists an unmatched person a in Mi do 
10: Build a Hungarian tree T a rooted at a. 
11: if there exists no augmenting path starting at a then 
12: Quit and declare "G does not admit any popular matching". 

13: else 

14: Augment Mi along the cheapest augmenting path in T a and call the new 

matching Mj+i. 
15: end if 
16: i = i + l. 
17: end while 
18: Return Mi. 



We present our entire algorithm in Algorithm [21 To see the correctness of 
the algorithm we first note that if there is no augmenting path in T a , where a 
is an unmatched person in Mi, then there is no popular matching in G. This is 
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because every popular matching is a maximum cardinality matching on rank-1 
edges and has to match every a £ A to a item in f(a) Us (a). It remains to prove 
that if G admits a popular matching, then the matching M(= Mi) returned at 
Step 18 of Algorithm [5] is a min-cost popular matching. We prove that using 
Lemma [7] 

Lemma 7. If G admits a popular matching, then the matching M returned by 
our algorithm is a min-cost popular matching in G. 

Proof. Suppose M is not a min-cost popular matching in G and let OPT be such 
a matching. For the purpose of this proof we operate on the cloned graph where 
each item b has copies(6) many copies and M and OPT both refer to matchings 
where each item is matched to at most one person. Consider OPT M - this is 
a collection of cycles and even length alternating paths (since both OPT and M 
are ^4-complete). The cycles do not contribute to any change in costs since both 
OPT and M match the same items in any cycle. 

Let p be a path in OPT © M. Let /3o and /3m be the endpoints of this path, 
where OPT leaves /3m unmatched while M leaves /3q unmatched. It suffices to 
show that cost (/3m) < cost(/3o). Since OPT is a popular matching, it has to 
match all the items in O U U (the odd/unreachable items in G\). Since it leaves 
Pm unmatched, it follows that /3m € £ and thus there are items of £ in p. 

It is the second stage of our algorithm that matches items in £ . Let a.\ be the 
last person in the path p to get matched by our algorithm and let M(pt\) — f}\. 
Since /Jo is unmatched in M it implies that during the execution of our algorithm 
we found at least two augmenting paths from a\ - one ending in fi\ and the other 
ending in /3o- Further, we found the augmenting path ending in /3i cheaper, that 
is, cost(/3i) < cost(/3o). 

We now repeat the same argument for the (3i-/3m sub-path of p. Let ui be 
the last person in the /3\-/3m sub-path that got matched by our algorithm and 
let Mia?) = /?2- Note that /3i was also unmatched at this time and hence our 
algorithm found at least two augmenting paths from ai - one ending in fix and 
another ending in /3 2 . Since M{d2) = $i it implies that cost(/3 2 ) < cost(/3i). 

Repeating the same argument for the /?2 _ /3a/ sub-path we get vertices /J3, . . . , j3 t = 
Pm where cost(/3 2 ) < cost(/3 3 ) < • • • < cost(/3t). Combining all the inequalities 
yields cost (/3m) < cost(/3 ). □ 

Time complexity of this algorithm. The difference between our algorithm and 
that of Manlove and Sng for the CHAT problem in the first stage is that they 
use Gabow's algorithm to find a matching on rank-1 edges whereas we use Ford- 
Fulkerson max-flow algorithm. Gabow's algorithm runs in time 0(yCm) where 
C = z2l=i copies(&i) whereas since the value of max-flow in the graph F(Gi) is 
upper bounded by |^4| = m, Ford-Fulkerson algorithm takes 0{mn\) time. Also, 
the total time taken by our algorithm to partition vertices into 0,£ , and U is 
0(m + n) where n denotes the total number of vertices in G. It is easy to see 
that the time spent by our algorithm in the second stage is also 0(mm) since 
it takes 0(m) time to build the tree T a and there are at most n% such trees that 
we build. We can now conclude the following theorem. 
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Theorem 7. There exists an 0{mn\) time algorithm to decide whether a given 
instance G of the min-cost popular matchings problem admits a popular matching 
and if so, to compute one with min-cost. 

Note that by assigning a huge cost C > ^2 b copies(6).cost(&) to each of the 
last items t a ,a <G A that we introduced, our algorithm also works for the min- 
cost maximum-cardinality popular matching problem where we seek among all 
popular matchings of maximum cardinality, the one with minimum cost. 

Conclusions 

In this paper we considered several extensions of the popular matching problem. 
We showed that the min-cost popular instance problem, which involves building 
a min-cost graph that admits a popular matching that matches all applicants, 
is NP-hard, even when preference lists are strict and of length at most 2. In 
contrast, the min-cost augmentation problem admits a simple polynomial time 
algorithm when preference lists are strict and of length at most 2. However, 
the min-cost augmentation problem is NP-hard in general; it is NP-hard even 
when preference lists are strict and of length at most 3. In fact, it is NP-hard to 
approximate the min-cost augmentation problem to within a factor of y/ni/2, 
where m is the number of people. We also showed that the min-cost popular 
matching problem (the number of copies of each item is fixed here) can be solved 
in 0(mni) time, where m is the number of edges in the input graph. 
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